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Abstract. We investigate the surjectivity of the word map defined by the n-th Engel word 
on the groups PSL(2, q) and SL(2, q). For SL(2, q), we show that this map is surjective onto 
the subset SL(2, q)\{—id} C SL(2, q) provided that q > qo(n) is sufficiently large. Moreover, 
we give an estimate for qo(n). We also present examples demonstrating that this does not 
hold for all q. 

We conclude that the n-th Engel word map is surjective for the groups PSL(2,g) when 
q > qo(n). By using the computer, we sharpen this result and show that for any n < 4, the 
corresponding map is surjective for all the groups PSL(2,q). This provides evidence for a 
£SJ ' conjecture of Shalev regarding Engel words in finite simple groups. 

In addition, we show that the n-th Engel word map is almost measure preserving for the 
family of groups PSL(2,g), with q odd, answering another question of Shalev. 

Our techniques are based on the method developed by Bandman, Grunewald and Kun- 
00 ' yavskii for verbal dynamical systems in the group SL(2, <?). 



1. Introduction 

1.1. Word maps in finite simple groups. During the last years there was a great interest 
in word maps in groups (for an extensive survey see [Se] ) . These maps are defined as follows. 
Let w = w(xi, . . . , Xd) be a non-trivial group word, namely a non-identity element of the free 
group Fd on X\, . . . , Xd- Then we may write w = x™*x?* . . . x™ fc where 1 < ij < d, rij G Z, 
and we may assume further that w is reduced. Let G be a group. For gi, . . . , g d we write 

O^: w( gi ,...,g d )=gZg%...g%eG, 
and define 

w(G) = {w(g u ...,g d ):gi,...,g d &G}, 

as the set of values of w in G. The corresponding map w : G d — > G is called a word map. 

It is interesting to estimate the size of w(G). Borel |Bo] showed that the word map induced 
by w 7^ 1 on simple algebraic groups is a dominant map. Larsen [La] used this result to 
show that for every non-trivial word w and e > there exists a number C(w, e) such that if 
G is a finite simple group with \G\ > C(w,e) then > | | 1 e . By a celebrated result 

of Shalev ph09] one has that for every non-trivial word w there exists a constant C (w) such 
that if G is a finite simple group satisfying \G\ > C(w) then w(G) 3 = G. These results were 
substantially improved by Larsen and Shalev |LSj for various families of finite simple groups, 
and have recently been generalized by Larsen, Shalev and Tiep |LST] . 

One can therefore ask whether w(G) = G for any non-trivial word w and all finite simple 
non-abelian groups G. The answer to this question is clearly negative. It is easy to see that 
if G is a finite group and m is an integer which is not relatively prime to the order of G 
then for the word w = x™ one has that w(G) ^ G. Hence, if v 6 Fd is any word, then 
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the word map corresponding to w = v m cannot be surjective. A natural question, suggested 
by Shalev, is whether these words are generally the only exceptions for non-surjective word 
maps in finite simple non-abelian groups. In particular, the following conjecture was raised: 

Conjecture 1.1 (Shalev). |Sh07j Conjectures 2.8 and 2.9]. Let w ^ 1 be a word which is 
not a proper power of another word. Then there exists a number C(w) such that, if G is 
either A r or a finite simple group of Lie type of rank r, where r > C(w), then w(G) = G. 

It is now known that for the commutator word w = [x,y] £ F 2 , one has that w(G) = G 
for any finite simple non-abelian group G. This statement is the well-known Ore Conjecture, 
originally posed in 1951 and proved by Ore himself for the alternating groups [Or] . During 
the years, this conjecture was proved for various families of finite simple groups (see [LOSTJ 
and the references therein). Thompson |Th] established it for the groups PSL(n,g), later 
Ellers and Gordeev [EGJ proved the conjecture for all finite simple groups of Lie type defined 
over a field with more than 8 elements, and recently the proof was completed for all finite 
simple groups in a celebrated work of Liebeck, O'Brien, Shalev and Tiep [LOSTJ. 

There was also an interest in quasisimple groups. By [Th] and [LOST] , in every quasisimple 
classical group SL(n, q), SU(n, q), Sp(n, q), fi ± (n, q), every element is a commutator (a 
quasisimple group G is a perfect group such that G/Z{G) is simple). However it is not true 
that every element of every quasisimple group is a commutator, see the examples in [Bl] . 

1.2. Engel words. After considering the commutator word, it is natural to consider the 
Engel words. These words are defined recursively as follows. 

Definition 1.2. The n-th Engel word e n (x,y) £ F 2 is defined recursively by 

ei(x,y) = [x,y] = xyx~ x y~ x , 
e n (x,y) = [e n -i,y], for n > 1. 

For a group G, the corresponding map e n : G x G — > G is called the n-th Engel word map. 

Now, the following conjecture is naturally raised. 

Conjecture 1.3 (Shalev). Let n £ N, then the n-th Engel word map is surjective for any 
finite simple non-abelian group G. 

For some (small) finite simple non-abelian groups this conjecture was verified by O'Brien 
using the MAGMA computer program. 

Note that in order to complete the proof of Ore's Conjecture, Liebeck, O'Brien, Shalev 
and Tiep used the classical criterion dating back to Frobenius, characterizing the possibility 
of writing an element g in a finite group G as a commutator by the non-vanishing of the 
character sum 



(see [LOSTJ and the references therein). Unfortunately, it is unknown whether there is an 
analogous criterion for the possibility of writing an element as an Engel word e n , n > 1. 
Hence, Shalev's Conjecture seems to be substantially more difficult than Ore's Conjecture, 
even for certain families of finite simple groups, such as PSL(2, q). 
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1.3. Engel words in PSL(2,g) and SL(2,g). We consider Engel words in the particular 
case of the groups PSL(2,g) and SL(2,g), in an attempt to prove Conjecture 11.31 for the 
group PSL(2,g). 

By Thompson [Thj . every element of SL(n,g), except when (n, q) = (2, 2), (2, 3), is a 
commutator (including the central elements). Moreover, Blau [Bl] proved that with a few 
specified exceptions, every central element of a finite quasisimple group is a commutator. In 
particular, if G is a quasisimple group of simply connected Lie type, then every element of 
Z(G) is a commutator. Interestingly, such a result fails to hold for Engel words. 

Indeed, in the group SL(2,g), where q is odd, if n > n (g) is large enough then the 
central element —id cannot be written as an n-th Engel word, namely e n (x,y) ^ —id for 
any x, y G SL(2,g) (see Proposition 14. 7p . implying that the n-th Engel word map is not 
surjective. This leads us to introduce the following notion of "almost surjectivity". 

Definition 1.4. A word map w : SL(2,g) d — > SL(2,g) is almost surjective if u>(SL(2,g)) = 
SL(2,g)\{-zd}. 

A method for investigating verbal dynamical systems in the group SL(2, q), using the so- 
called trace map, was introduced in |BGK] . We use this method to study the dynamics of 
the trace map instead of solving equations in groups. There is a special property of the Engel 
word e n (x, y) which makes the dynamics of the trace map particularly amenable to analysis: 
for a group G the morphism G 2 — > G 2 defined as (x, y) i— > (e n (x,y),y) is not dominant. 
Using this method we obtain the following result. 

Theorem A. Let n G N, then the n-th Engel word map is almost surjective for the group 
SL(2, q) provided that q > qo(n) is sufficiently large. 

We moreover give an estimate for qo(n) which, unfortunately, is exponential in n (see 
Corollary ES]). 

Theorem IA1 certainly fails to hold for all groups SL(2,g). Indeed, we give examples for 
integers n > 3 and finite fields F q for which the n-th Engel word map is not almost surjective 
for SL(2,g) (see Example I4.ip . We moreover show that there is an infinite family of finite 
fields F q , such that if n > n$(q) is large enough, then the n-th Engel word map in not almost 
surjective on SL(2,g) (see Proposition I4.9p . 

Considering the group PSL(2,q r ), we see that Theorem IA1 immediately implies that the 
n-th Engel word map is surjective for the group PSL(2,g) provided that q > qo(n). Thus, 
when n is small, one can verify by computer that the n-th Engel word map is surjective for 
the remaining groups PSL(2, q) with q < qo(n), hence for all the groups PSL(2, q). 

Corollary B. Let n < 4 then the n-th Engel word map is surjective for all groups PSL(2, q). 

We have moreover shown that there are certain infinite families of finite fields F q for which 
the n-th Engel word map in PSL(2, q) is always surjective for every n G N. The first family 
consists of all finite fields of characteristic 2 (see Proposition I4.10j) . and the second family 
contains infinitely many finite fields of odd characteristic (see Proposition I4.11j ). Following 
Conjecture [L3] we believe that the surjectivity should in fact hold for all groups PSL(2,g). 

1.4. Equidistribution and measure preservation. Another interesting question is the 
distribution of a word map. For a word w = w(xi, . . . , xj) G F d , a finite group G and some 
g G G, we define 

N w(g) = {(gi, ...,g d )^G d : w(g h ...,g d ) = g}. 
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It is therefore interesting to estimate the size of N w (g), and especially to see whether w is 
almost equidistributed, namely if \N w (g)\ w IG^ 1 for almost all g G G. More precisely, we 
define: 

Definition 1.5. A word map w : G d — > G is almost equidistributed for a family of finite 
groups 3f if any group G G contains a subset S 1 = C G with the following properties: 

(i) |5| = |G|(l-o(l)); 

(«; |A^(#)| = \G\ d -\l + o(l)) uniformly for all g e S; 
where o(l) denotes a real number depending only on G which tends to zero as \G\ — > oo. 

An important consequence (see |GS[ §3]) is that any "almost equidistributed" word map 
is also "almost measure preserving", that is: 

Definition 1.6. A word map w : G d — > G is almost measure preserving for a family of finite 
groups £f if every group G G W satisfies the following: 

(i) For every subset FCGwe have 

\w- 1 (Y)\/\G\ d =\Y\/\G\+o(l); 

(ii) For every subset X C G d we have 

KX)|/|G|>|X|/|G| d -o(l); 

(in) In particular, if X C G d and |X|/|G| d = l — o(l), then almost every element g G G 
can be written as g = w(gi, . . . , go) where g\, . . . , ga G X; 

where o(l) denotes a real number depending only on G which tends to zero as |G| — > oo. 

The following question was raised by Shalev. 

Question 1.7 (Shalev). [Sh071 Problem 2.10]. Which words w induce almost measure 
preserving word maps w : G d — » G on finite simple groups Gl 

It was proved in |GS] that the commutator word w = [x, y] G F2 as well as the words 
w = [x\, . . . ,Xd\ G Fd, d-fold commutators in any arrangement of brackets, are almost 
equidistributed, and hence also almost measure preserving, for the family of finite simple 
non-abelian groups. 

A natural question, suggested by Shalev, is whether this remains true also for the Engel 
words. We prove that this is indeed true for the family of groups PSL(2, q), where q is odd. 

Theorem C. Let n G N, then the n-th Engel word map is almost equidistributed, and hence 
also almost measure preserving, for the family of groups {PSL(2, q) : q is odd}. 

Since it is well-known that almost all pairs of elements in PSL(2,g) are generating pairs 
(see |KL] ). we deduce that for any n G N, the probability that a randomly chosen element 
g G PSL(2, q), where q is odd, can be written as an Engel word e n (x, y) where x, y generate 
PSL(2, q), tends to 1 as q — > 00. 

It was proved in |MWj that when q > 13 is odd, every nontrivial element of PSL(2,g) is 
a commutator of a generating pair. One can therefore ask if a similar result also holds for 
the Engel words. 
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1.5. Notation and layout. Throughout the paper we use the following notation: 

• G = PSL(2,g); 

• G = SL(2,g); 

• F q - the algebraic closure of the finite field F q ; 

• \M\ - number of points in a set M; 

• ___ x - k- dimensional affine space with coordinates Xi, ...,Xk] 

• p(s, u, t) = s 2 + t 2 + u 2 - sut - 2; 

• d(X) - degree of a projective set X; 

• g(X) - geometric genus of a projective curve X; 

• stands for n th iteration of a morphism /. 

Some words on the layout of this paper. In Section [2] we recall the general method 
developed in [BGKJ for investigating verbal systems in the group SL(2,g). We apply this 
method to Engel words in Section [3j In Section H] we discuss the surjectivity (and non- 
surjectivity) of Engel words in the groups SL(2, q) and PSL(2, q) for certain families of finite 
fields. The proof of our main theorem, Theorem appears in Section [51 In Section [6] we 
check the surjectivity of short Engel words for all groups PSL(2,g) and prove Corollary IB1 
The proof of the equidistribution theorem, Theorem O, appears in Section In Section [S] 
we discuss further questions and conjectures. 

2. The trace map 

The main idea is to use the method which was introduced in [BGKJ to investigate ver- 
bal dynamical systems. This method is based on the following classical Theorem (see, for 
example, [Vo[ IFr] IFK] or [Mat IGoj for a more modern exposition). 

Theorem 2.1 (Trace map). Let F = (x,y) denote the free group on two generators. Let us 
embed F into SL(2,Z) and denote by tr the trace character. If w is an arbitrary element of 
F, then the character of w can be expressed as a polynomial 

tr(tu) = P(s, u, t) 

with integer coefficients in the three characters s = tr(x),u = tr(xy) and t = tr(y). 

Note that the same remains true for the group G = SL(2, q). The general case, SL(2, R), 
where R is a commutative ring, can be found in [CMSj . 

The construction used below is described in detail in [BGKj . In this construction, SL(2, F q ) 

is considered as an affine variety, which we shall denote by G as well, since no confusion may 
arise. We will also consider SL(2,F ? ) as a special fiber at q of a Z-scheme SL(2,Z). 

For any x,y G G denote s = tr(x), t = tr(y) and u = ti(xy), and define a morphism 
tt: G xG^A^ by 

ir(x,y) := (s,u,t). 

Theorem 2.2. [BGK[ Theorem 3.4]. For every F q -rational point Q = (s ,u ,t ) G A^ ui; 
the fiber H = vr _1 (Q) has an F q -rational point. 

Let u(x, y) be a word in two variables and let (p : G x G — > G be a morphism, defined as 
<f(x,y) = w(x,y). 

The Trace map Theorem implies that there exists a morphism ip : A^ u t — > A^ u t such that 

^(vr(x, y)) = Tr(0(x, y),y). (2.1) 
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This map is called the "trace map" , and it satisfies 

il>(s,u,t) : = (A( S ,M),/ 2 ( S ,M),t), (2.2) 
where f x (s,u,t) = tx(0(x,y)) and J 2 (s 1 u,t) = ti(ip(x,y)y). 

Define ip = (<p, id) : G x G ^ G x G by ip(x,y) = (ip>(x,y),y). Then, according to ( 12. ip 
and (12. 2p . the following diagram commutes: 

G x G — ^ GxG 

(2.3) 

Therefore, the main idea is to study the properties of the morphism ip instead of the 
corresponding word map u. 

As will be shown later, the morphism ip corresponding to Engel words is much simpler. 
Moreover, it follows from Theorem 12.21 that the surjectivity of implies the surjectivity of 
(p (see Proposition 13.61) . 

3. Trace maps of Engel words 

Let e n = e n (x, y) : G x G — > G be the n-th Engel word map, and let s n = tr(e n (x, y)). 
Then 

s\ = tr(ei(a;, y)) = tr([x, y\) = s 2 + t 2 + u 2 — ust — 2 = p(s, u, t). (3.1) 
Moreover, for n > 1, 

tr(e„(x, y)y) = tx{e n ^ 1 ye~\y~ x y) = ti{e n ^ye~\) = tr(y) = t. (3.2) 
Therefore, for n > 1, 

s n+ i = tr(e n+ i) = p(s n , t, t) = s 2 n - s n t 2 + 2t 2 - 2. (3.3) 
In the notation of diagram (12. 3|) we have 

i/j(s, it, t) = (p(s, u, t),t, t). (3.4) 
This yields a corresponding map ip n+ \ : A z sut — > A^ ut , which satisfies 

ip n +i(s, u, t) = ^ (n+1) (s, u, t) = if)(s n , u, t) = (p(s n , t, t),t, t) = (s n+1 ,t, t). (3.5) 

Remark 3.1. If n > 1 and tr(y) ^ then e n (x,y) ^ —id, since tr((— id)y) = — tr(y) 7^ tr(y) 
in contradiction to (13.21) . 

Define H = {(x,y) G GxG\ tr(xy) = tr(y)} and A = {(s, u, t) e A 3 s ^ t \ u = t} = A 2 1 , then 
7r(H) C A. Equation (I3.5P now shows that in order to find the image of ip n : A^ u t — > A]j u t , 
one may consider its restriction (U" : A 2 t — > A 2 t , where fi(s,t) = (s 2 — st 2 + 2t 2 — 2,t). 

Definition 3.2. Let us introduce the following morphisms: 

• ip n :G xG -HS x G, <p n (x,y) = (e n+ x(x,y),y), (p n (x,y) = ip^ +1 \x,y); 

• 9 : Gx G -> G, 6(x,y) = x; 

• r : G -> A], t(x) = tr(x); 

• Ai : Al t AJ, Ai(s,t) = s; 

• ^2 : Aj W)t -> A 2 Sjt , X 2 (s, it, i) = (s, t); 

• /i : A 2 Sit -> A 2 t , /x(s, t) = (s 2 - st 2 + 2t 2 - 2, t); 
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• Hn = /i (n) ; 

• p n : A 2 t -> A\, p n = Ai o /i n ; 

These morphisms determine the following commutative diagram: 



(n) 

~ ~ <P0 <?0 

G x G *H 



A 3 



u,t 




G 



(3.6) 



Remark 3.3. 9 o ip n (x,y) = e n+1 (x,y) and Vvh( s ,M) = (p n o X 2 (^(s,u,t)),t,t). 

Equation ( 13. 4 p shows that the morphism G 2 — > G 2 defined as (x,y) h-> (e n (x,y),y) is not 
dominant, since the trace map ip of the first Engel word ei(x,y) = [x,y] maps the three- 
dimensional affine space A 3 into a plane A = {u = t}. One can consider the trace maps of 
the following Engel words e n+ i as the compositions of this map ip with the endomorphism 
p n of A. 

First, in Proposition I3.4[ we find the image ?/>(A 3 ) C A and then in Proposition 13.61 we 
establish the connection between the image of p n and the range of the corresponding Engel 
word e n+ i. In the next section we shall study the properties of p n . 

Proposition 3.4. The image ^ q = ?/>(A 3 u t (F g )) is equal to: 

(1) A(¥ q ), if q is even; 

(2) A(¥ q ) \Z q C A{¥ q ), ifq is odd, where 

Z q = {(s, t, t) G A\ t 2 = 4 and s — 2 is not a square in ¥ q }. 
Proof. A point (s,t,t) G ^ q if C Sjt (¥ q ) ^ 0, where 

C S)t = {(s',u,t)\ p(s',u,t) = s}. 

Now, 

p{s\ u,t)-s = s' 2 + u 2 + t 2 - us't - 2 - s. 
Case 1. q is even. Then the equation 

p(s', u,t) — s = s' 2 + u 2 + t 2 — us't — 2 — s = 

has an obvious solution s' = 0, u 2 = t 2 + s, since every number in ¥ q is a square. 
Case 2. q > 3 is odd. Then 



p(s', u,t)-s = s' 2 + u 2 + t 2 - us't - 2 - s = (s' - y ) 2 - it 2 (~ — 4~~~) + ~ 2 



s. 



Thus, C Sj t for a fixed t, is a smooth conic if t 2 — 2 — s ^ and t 2 7^ 4, with at most two 
points at infinity. If t 2 — 2 — s = then C Sj i is a union of two lines 



ut. 



{(s' - y) - = 0} U {(s> - y) + yW^l = 0} 

which have a point (V = 0, u = 0) defined over any field, provided t 2 — 4 ^ 0. 
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If t 2 — 4 = 0, then the equation 

p(s', u, t) - s = (s' - y ) 2 + 2 - s = 
has a solution if and only if s — 2 is a perfect square. □ 

Definition 3.5. Let us define the following sets: 

• E n+ i = 9 o (p n {G x G) = {z G G : there exist (x,y) G G x G s.t. e n+ i(x, y) = z}; 
. Y 9 = A 2 (* 9 ); 

• r; = A 2 (vI/ g )\{( s ,t):t = 0}; 
. T n (F g ) = p n (F g ); 

. T^(F 9 ) = Pn (y;). 

Proposition 3.6. 

(A) If q > 2 is even and a G F 9 , then the following two statements are equivalent: 

(i) a G T n (F q ) = p n (\2(A(¥ q )))- 

iii) Any element z G G with tr(z) = a belongs to E n+ i. 

(B) If q > 3 is odd and a G F^, a ^ —2, then the following two statements are equivalent: 

(i) a G T n (¥ g ) = ^(AaC*,)); 

(ii) Am/ element z G G wift tr(z) = a belongs to E n+ \. 

(C) If q > 3 is odd and —2 G T^(F 9 ) i/ien ever?/ element z E G, z ^ —id, with tr(z) = —2 
belongs to E n+1 . 

Proof. It is obvious that if z = e n +i(x,y), then a = tv(z) = p n o \2(ip(ti(x), tr(xy), tr(y)). 
Thus we need to prove the implications (z) =>■ (ii). 

Assume that a = p n (s,t) for some (s, t) E Y q = A2 (^g) - Since if) is surjective onto ^g, 
there exists a point (s',u,t) G A 3 (F g ) such that (s,t,t) = i/;(s',u,t). Since the morphism 7r 
is surjective for any field, one can find (x',y r ) G G x G such that ir(x',y') = (s',u,t). Let 
f = e n+ i(V, ?/'), then tr(t>) = a (see digram (13.61) ). 

Case 1. Either q is even and a^O, or q is odd and a 7^ ±2. 

In this case, a = tr(z) = tr(u) implies that v is conjugate to z, i.e. z = gvg~ x for some g G 
G. Therefore e n+ i(gx' g^ 1 , gy'g^ 1 ) = gvg^ 1 = z, and so one can take x = gx'g~ l ,y = gy'g^ 1 . 

Case 2. Either q is even and a = 0, or q is odd and a = 2. 

Observe that 2 always belongs to T n (F q ) since 2 — 2 = is a perfect square and (2, t) is a 
fixed point of \i n . 

(\ c\ 

It suffices to prove that all matrices w = I ^ , c G F g , are in the image E n . Since 

1 b\ (a 0\\ _ A 6(1 - a 2 )™ 

v'V° i;; v° 1 

one can take some / a G F, with a 2 7^ 1 and b = jjz^yi ■ 
Case 3. q is odd and a = —2. 

If -2 G T^(Fq) then v 7^ -id by Remark [3J] Choose a G F ?2 \ F g such that a 2 G ¥ q . Let 



\^ a J 

Then mvmT 1 G G and moreover, either -y or mvm~ x is conjugate to z in G. 



m = I 1 
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If v is conjugate to z, then we proceed as in Case 1. 

If mvm" 1 is conjugate to z, then we consider the pair (x" = mx'm" 1 ,y" = my'm" 1 ) E 
G x G. We have mvm" 1 = e n+ i(x",y"), and we may continue as in Case 1. □ 

Corollary 3.7. IfaE ¥ q , a ^ -2 belongs to the image p n (Ag t )(W g ), then any element z E G 
with tx{z) = a belongs to E n . 

Proof. Indeed, p n (A 2 t ) C p n _i(X 2 (^f g )) , because ip(s,t,t) = (p(s,t),t,t). □ 

Definition 3.8. When q is odd, the point (s,t) E A 2 st is called an exceptional point if either 
t = or t 2 = 4. The set of all exceptional points is denoted by T. 

Corollary 3.9. If either q is even and a E p n (A 2 t ), or q is odd and a E p n (A 2 t \ T), then 

any element z E G = G(¥ q ) with tr(z) = a belongs to E n+ i, i.e there exist (x,y) E G x G 
such that z = e n +i(x,y). 

Corollary 3.10. If q is odd and T n (¥ q ) contains either a or —a for every a E ¥ q then the 
Engel word map e n +i is surjective on PSL(2,g). 

Proof. It follows from Proposition 13.6( B) and the fact that both elements z E SL(2,g) and 
— z E SL(2, q) represent the same element of PSL(2, q). □ 

4. SURJECTIVITY AND NON-SURJECTIVITY OF ENGEL WORDS OVER SPECIAL FIELDS 

The following examples show that the n-th Engel word map (for n > 3) is not always 
almost surjective on SL(2, q) (in light of Proposition 13. 6p . However, it is still conjectured 
that it is surjective on PSL(2,g) (see Conjecture II. 3p . 

Example 4.1. In the following cases, computer experiments using Magma show that there 
is no solution to p n = a in ¥ q . 

• There is no solution in Fn to p n = 9 for every n > 2. 

• There is no solution in F13 to p n = 4 for every n > 5. 

• There is no solution in F17 to p n = 10 for every n > 2, to p n = 4 for every n > 4, 
and to p n = 5 for every n > 5. 

• There is no solution in F23 to p n = 16 for every n > 2. 

• There is no solution in F 53 to p n = 31 for every n > 8. 

• There is no solution in F 6 7 to p n = 4 for every n > 10. 

Remark 4.2. In fact, it is sufficient to check any of the above examples for all integers 
n < q, since for every (s,t) E ¥ 2 there exist some N < q such that p^(s,t) is a periodic 
point of p. 

Following some further extensive computer experiments using Magma, in which we checked 
all q < 600 and n < 50, we moreover suggest these conjectures (see also Proposition 14.91 be- 
low). 

Conjecture 4.3. For every finite field ¥ q , a E ¥ q and n E N, unless either a = 1 and 
a/2 ^ ¥ q , or the triple (q,a,n) appears in one of the cases in Example \4- 1\ one has that p n 
attains the value a. 

Conjecture 4.4. For every finite field ¥ q , a E¥ q and n E N, either a or —a is in the image 
of pn- 
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Observe that if the first conjecture is true then so is the second. 

We continue by considering some special infinite families of finite fields. We will mainly 
use the following properties of the maps p n and p n . 

Properties 4.5. 

(1) p(l,t) = (t 2 -l,t); 

(2) ^(t 2 - m) = (i 2 - 

(3) /i(2,t) = (2,t); 

(4) p(t 2 -2,t) = (2,t); 

(5) p n (s,0) = x 2n + if s = x+±; 

(6) p n {s,t) = {s-l) 2n + l, ift 2 = 2; 

(7) p n (s,t) = (s-2f l +2, ift 2 = L 

Corollary 4.6. Let t G ¥ q . Then t 2 — 1 is in T n (¥ q ) for every n. 

Proof. Item ([2]) implies that the point (t 2 — l,t) is a fixed point of p. Moreover, if t 2 = 4, 
then (t 2 — 1) — 2 = 1 is always a square, and hence t 2 — 1 G ^ q for every q. □ 

We shall now explain why —id cannot appear in the image of long enough Engel words, 
motivating Definition II .41 of "almost surjectivity". 

Proposition 4.7. If n > 1 and q > 7 is an odd prime power, then there is a solution 
(x, y) G G 2 to the equation e n+ i (x, y) = —id if and only if there exists some c G ¥ g 2 satisfying 
c z = -1. 



Proof. Assume that e n+ i(x,y) = —id. Then, by Remark 13. 1[ there exists some b G ¥ q such 



that p n (b, 0) = —2. According to Properties 14.51 (15]) . 

Pn (b,0) =C 2 " + -^, 

where c G ¥ q 2 is defined by the equation b = c + -. Thus, 



implying that 



on 1 _ 

c 2 + — = -2, 
c z 



(c 2 "" 1 + ^r) 2 = 0, 



c 

and so 

c 2 " = -1. 

On the other direction, assume that there exists some c G ¥ q 2 satisfying c 2 ™ = —1, let 
b = c + and denote 



.4 



c 
i 



Consider the rational curve C defined by s 2 + w 2 = 6 + 2. Note that 6 + 2^0 since -1. 
Thus, being a smooth rational curve, C(¥ q ) has at least q — 1 points. If g > 7, there are 
points in C(F g ) such that s ^ ±2. Let (s,w) be such a point, and let x ,y G SL(2,g) 

be any pair of matrices such that tr(xo) = s,tr(xo2/o) — u ,^ T (yo) — 0. 
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We shall show that e n+ i(a;o, Vo) = — id. Consider xq and yo as elements of G\ = SL(2, Fx), 
where Fi is a quadratic extension of W q such that c G Fx. Let 7Ti : G\ — > A 3 (Fx) be the trace 
projection: 

ni(x,y) = (tr(x),tv(xy),tv(y)). 
Then any pair (xi, satisfying that 7i"i(:Ei, yi) = (s, w, 0) is conjugate to the pair (x , Vo) in 
Gi, namely, there exists g E Gx such that xi = gx$g~ l , yx = <7yo<7 -1 - 

Hence, e n+1 (x ,y ) is conjugate in Gx to e n+1 (xi, y x ). 

Take 

'0 f 
-1 



sc uc 



xx = I C +J c t x J , yi 



c+1 c+1 

A direct computation shows that 
Let us now compute e n (A, yi). Let 




X(a) = 

Then 
and so, 

e n (X(a),yx) 



a 
i 



a 2 



4 



a 2 " 

4r 



Since A = X(c), then e n (A, yi) = — zcf. In addition, A = ex(xx,yx), and hence e n+ x(xx,yx) = 
—id. But then e n+ i(xo, yo) is conjugate to —id, and therefore e n+ i(xo?yo) = —id as well. □ 

Remark 4.8. If n > q then the equation c 2 " = —1 has no solution in ¥ q 2, and hence 
e n (x, y) 7^ —id for every ij6 SL(2, g). 

However, —id can be written as a commutator of two matrices in SL(2, q), where q is odd. 
Indeed, take a, b G ¥ q satisfying a 2 + b 2 = — 1, then 

l\ /a b 
-1 OJ'U -a 



—id. 



(See |Th] and [Blj for a general result regarding central elements in SL(n, q) and other 
quasisimple groups). 

We moreover show that there exists an infinite family of finite fields W q , for which the ra-th 
Engel word map in SL(2, g) is not even almost surjective for sufficiently large n > no(q). 

Proposition 4.9. Let ¥ q be a finite field which does not contain y2. Then there exists some 
integer n = n (q) such that for every n > n , p n ^ 1. 

Proof. Since the set of points (s,t) G A 2 (F g ) is finite, every point is either periodic or 
preperiodic for // n . This means that for every (s,t) G A 3 (F g ) there are numbers n(s,t) and 
m(s,t) < n(s,t) such that 

l^n( s ,t)(s,t) = fi m (s,t)(s,t). 
For a point (s,t) we define n(s,t) as the minimum of all possible n(s,t). 
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Let 

n = max{n(s,t) : (s,t) e A 2 (¥ q )}. 

Then every (s,t) G R no = p no (A 2 (¥ q )) is periodic and R n = R no for any n > n . In order to 
show that p n 7^ 1 it is sufficient to show that for any t, the point (1, t) G" R no , i.e. to show 
that it is not periodic. Indeed, p(l, t) = (t 2 — 1, t) which is a fixed point for any t. Thus, for 
every k > we have /z fc (l, t) = (t 2 - 1, i) ^ (1, t), if t 2 ^ 2. □ 

On the other hand, we show that there are certain infinite families of finite fields ¥ q for 
which the n-th Engel word map in PSL(2, q) is always surjective for every n G N. The first 
family consists of all finite fields of characteristic 2, and the second family contains infinitely 
many finite fields of odd characteristic. 

Proposition 4.10. For every n > 1, the Engel word map e n is surjective on the group 
PSL(2,g) for q = T, e > 1. 

Proof. In this case 

/i(s, t) = (s 2 - st 2 , t), p(s, 0) = s 2 . (4.1) 

Thus p(s, 0) is an isomorphism of Aj(F ? ), as well as any of its iterations p n (s, 0). According 
to Proposition 13. 6[ this implies the surjectivity of the n-th Engel word map on PSL(2, q) = 
SL(2,g). ' □ 

Proposition 4.11. For every n > 1, the Engel word map e n+ \ is surjective on the group 
PSL(2, q), if a/2 G ¥ q and a/^T £ F ff . 

Proof. By Corollary I3.10[ we need to show that either a G T n (¥ q ) or —a G T n (¥ q ) for every 
a G ¥ q . 

In this case, the map x — > x 2 is a bijection on the subset of perfect squares of ¥ q . It follows 
that if a = b 2 for some b G F 9 , then for every n, there is some b n G ¥ q such that a = b 2 ™ . 
Moreover, for every a G ¥ q either a = b 2 for some b G ¥ q or a = —b 2 for some b G ¥ q . 

Assume that z G PSL(2,g) and z / e n+ i(x,y). Let tr(^) = a. Then, by Corollary I4.6[ 
neither a + 1 nor —a + 1 is a square in ¥ q . It follows that a + 1 = — c 2 and —a + 1 = — b 2 for 
some 6, c G F g . Hence, a = b 2 + 1 = b 2 ™ + 1 = p„(&„ + 1, \/2) according to Properties I4.5p)j ). 
yielding a G T n (¥ q ). □ 

5. Engel words in SL(2, q) for sufficiently large q 

In this section we prove Theorem |A] and show that the n-th Engel word map e n is almost 
surjective on SL(2, q) if q > qo(n) is sufficiently large. We moreover give an explicit estimate 
for qo{n), which, unfortunately, is exponential in n. 

Our proof has a geometrical flavor. Let us briefly describe it and explain the geometric 
idea behind our calculations. Consider the diagram (13. 6p . Instead of solving the equation 
tr(e n+ i(a;, y)) = a, we look for points defined over a ground field ¥ q in the curve {p n (s, t) = 
a}. This is an affine curve. In order to use the Weil inequality, we have to know that it has 
an irreducible component defined over the ground field ¥ q , and we need to estimate its genus 
and the number of punctures. 

To this end we represent the curve as a tower of double covers of a rational curve (see 
equation ( 15. 2p ). The geometrical interpretation of this procedure is an embedding of the 
curve into an affine space of a higher dimension A™ +2 . Zn K t- Then we consider the closure X 
of this curve in the corresponding projective space P"^t 2 . Xn . y:i i-w 
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It appears (see Lemma 15.21) that the intersection of X with the hyperplane at infinity 
consists of smooth points defined over F g for any q. Thus every irreducible component of X 
is defined over F 9 as well. Indeed, assume that an irreducible component (say, Xi) is defined 
over an extension of W q and is not invariant under the action of the corresponding Galois 
group T, then the T— invariant points would belong to the intersection Xi n 7pQ),7 G T, 
and therefore, the points defined over ¥ q would not be smooth. 

The rest of the proof deals with the estimation of the genus and the number of punctures. 

By Proposition 14.101 we may assume that q is odd. We continue to use the notation 
introduced in Definition 13.21 

Theorem 5.1. For every n G N there exists qo = qo{n) such that p n : A 2 t \ T — > A] is 

surjective for every field ¥ q with q > qo- Moreover, if n is a prime then there is an orbit of 
H of length precisely n. 



Proof. Together with the endomorphism \i : A 2 t — > 



morphism m : 



v 2 



\g t we may define the following endo- 



by 



m(z, x 



[z(z 



XL X 



(5.1) 



A direct computation shows that // may be reduced to (15.1 1) by the substitution z = 
s - 2, x = t 2 - 4. 

Similarly to the morphisms Ai(s, t) = s and p n — Ai o jj,( n \ we may define the morphisms 
I : & 2 — > Al, l(z, x) = z and r n = I o m^ n \ 

First, we note that s = 2 is always in the image of p n (see Proposition I3.6p . Note also that 
(s,t) = (—2,0) cannot be a periodic point, since /i(— 2,0) = (2,0), which is a fixed point. 

Now, assume that some a + 2 G ¥ q , a ^ is in the image of p n . This is equivalent to 
a = r n (z, x) for some z E ¥ q and x = t 2 — 4, t G W q . The last statement implies that the 
following system of equations has a solution in W q : 



' z 2 = zi(zt - x), 



Zn— 1 \Z"n— 1 
ZrX^n X 1 ), 



X 



(5.2) 



x = r 



4. 



Similarly, the orbit of length n is defined by the following system: 

z 2 = zi(z t - x), 



Z-n 


Zn—\\Zn 


-l — 


Zl = 


Zji \ Z j t 


x), 


X = 


t 2 -4. 





(5.3) 



If n is a prime, then system (15. 3p describes all the points in an orbit either of exact length 
n or of exact length 1. In the latter case, these points are Zi = x+ l,i = 1, . . . ,n and 

Zi 



0,i 



l,...,n. 
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Consider the projective space P n+2 (F g ) with homogeneous coordinates {xi : ■ ■ ■ : x n : y : 
d : w}. Assume that z% = £i , i = 1 . . . , n, x = ^, t = Then system (15. 2p defines in P™+ 2 
a projective set 



' x 2 w = xx(xi - y), 



X = < Xn w = x n -i(x n ^i - y), 
aw 2 = x n (x n - y), 
yw = d 2 — Aw 2 . 



(5.4) 



Similarly, system (15.31) defines a projective set 



' x 2 w = xx(xi - y), 

< x n w = x n _i(x„_i - y), 
Xiw = x n (x n - y), 
yw = d 2 — Aw 2 . 



(5.5) 



Lemma 5.2. The intersections S = X fl {w = 0} and S± — Xi n {w — 0} consist of 1 n 
smooth points with w = 0,d = 0,y = 1 and X{ = or 1 (for i — 1, . . . , n). 



Proof. If there was a point in X with w = 0,y = 0, then, according to (15.41) (respectively 
(15. 5p ). d and all Xj would vanish as well, which is impossible. Thus, y ^ at the points of 
5* and Si. But then (15 .4p (respectively (I5.5P ) implies that every Xi is either or y at the 
points of 5* (respectively Si). 

It follows, in particular, that the sets X and X\ have no components of dimension greater 
than 1, since the intersection of each such component with {w = 0} would be positive 
dimensional. 

Let us compute the Jacobian matrices of these systems. We have for (15.41) : 



0„ 



a, 



d 



Or 



d d 



— x 2 —Xi 2xi — y —w 
— x 3 — x 2 2x 2 — y 



-w 








-x n -x n _i 
—law —x n 
2d -8w-v -w 



. 2x n -y -w 
2x n - y 




and similarly for (15. 5p : 
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O w 


°u 


R 

C X1 


r) 




Xn 







-x 2 


—X\ 


2x 1 -y 


—w 













-x 3 







2x 2 - y 


— w 





(5. 





Xn 


Xn—l 








... 2x n -y 


— w 







-Xi 


Xn 


— w 








2x n - y 




2d 


—8w — y 


—w 

















Since at the points of S and Si the ranks of these matrices are n + 1, every point is 
smooth. □ 

Remark 5.3. In particular, we have proved that the map p n is surjective over every alge- 
braically closed field. Indeed, every component of X has dimension at least one, thus no 
fiber is contained in the set {w = 0}. 

Consider an irreducible component Ai (over ¥ q ) of X of degree di. If it was not defined 
over F q , then every point in Ai, which is rational over ¥ q , would be singular. But, according 
to Lemma [5.21 Ai has smooth points defined over ¥ q (namely, A, fl S). Thus, Ai is defined 
over Fq. Similarly, every irreducible component Bi of Xi is defined over W q . 

Let uj : P n + 2 — )• dw be defined as U)(x\ : • • • : x n : y : d : w) — (x± : d : w). Then u 
induces a birational map of every A^ (respectively Bi) on its image Ri = oj(Ai) (respectively 

Ui = co(Bi)), because of (15 .4p (respectively (15. 5p ). Thus, Ai is birational to the closure Y 
in P 2 of an irreducible component of the set 

Y^ = {r n ( Zl ,t 2 -4) = a}G A 2 Zut , 

which becomes 

Y^ = { Pn (s,t) = a + 2} C Al t , 
after the following change of a coordinates s — 2 + z±, {x\ — > x± + 2w). Similarly, Bi is 
birational to the closure in P 2 of an irreducible component of the set 

Z^ = {p n (s,t) = s}cAl t . 

The plane curves Ri and Ui are defined over the ground field as the projections of Ai and 
Bi respectively. Let d n < 3 2 " and j n < 3 2 " be the degrees of Y n and Z"' respectively. 

For the number N(q) of points over the field ¥ q in an irreducible curve C of degree d in 
P 2 , we use the following analogue of the Weil inequality (see |APj . |GL[ Corollary 7.4] and 
[LYl Corollary 2]), 



Hence, we obtain 



and 



\C(¥ q )-(q + l)\<(d-l)(d-2) y /d. 
\Ri(¥ q )\ > q + l-d 2 n ^, 
\Ui(¥ q )\>q + l-j 2 n ^. 



(5.8) 
(5.9) 
(5.10) 



Now we need to check how many of these points can be exceptional or at infinity. All 
these points are the intersection points with 4 lines: d — 0, d — ±2w, w = 0. By the Bezout's 
Theorem there are at most 4d n (respectively, 4j n ) such points. 
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For any q > 2d^ we have 

q + 1 - d 2 n ^q~ > 2c£ + 1 - d*V2 = d 4 n (2 - ^2) + 1 > 4d n . 
Similarly, for q > 2j*, 

q + 1 - JnVQ > 2 Jn + 1 - Jn^ = J*(2 - ^2) + 1 > 4 Jn . 

Thus, if q > max{2<, 2j*} then (Ri \ T)(F g ) ^ and ([/; \ T)(F ff ) ^ 0, which completes 
the proof of Theorem 15.11 □ 

Corollary 5.4. The map e n : G x G ^ G is almost surjective if G = SL(2, q) and q > qo(n) 
is big enough. 

Proof. According to Corollary 13. 9[ the almost surjectivity of e n+ i on SL(2, q) follows from 
the surjectivity of p n on A^\ T, which was proven in Theorem 15. II for any q > qo{n). □ 

In order to make the estimation for qo(n) more precise a detailed study of system (15. 4p is 
needed. 

Proposition 5.5. The curve X defined in (15 ,4p is irreducible provided a^O. Let v : X — > X 

be the normalization of X. Then the genus g(X) < 2 n (n — 1) + 1 and z>~ 1 (5') contains 2 n 
points. 

Proof. We will work over an algebraic closure of a ground field. For k — 1, . . . , n, we denote 
by Ck a curve defined in p n - fc + 2 by 

( x k+1 w = x k (x k - y), 



Ck 



(5.11) 

x n w = x„_i(x n _i - y), 
aw 2 = x n (x n - y). 



Lemma 5.6. If a ^ and q is odd, then the system (15. lip for k = 1 defines in P™ +1 a 
smooth irreducible projective curve C\ of genus g{C\) < 2 n ~ 1 (n — 2) + 1. 

Proof. Let g k denote the genus g(C k ) (if C k is irreducible). 

We shall prove by induction on r = n — k that all curves C k are irreducible and moreover 

g k < 2 n -\n - k - 1) + 1. 

Step 1. It is obvious that C n (r = 0) is an irreducible conic in P 2 and that g n = 0. At a 
point (a : : 1) G C n we may use the affine coordinates = ^, x = ^. A local parameter 
on C n at this point may be taken as z n — a, since 

a — s 

(see, for example, [DS|. I, Chapter 2, §1.6] for a definition of a local parameter). 

The induction step. Assume that for r = n — k the assertion is valid, namely: 

• The curve C k is smooth and irreducible; 

• z k — a k is a local parameter at every point (a k : • ■ ■ : a n : (3 : 1) G C k (w ^ 0); 

• £fc < 2 n " fe (n - jfe - 1) + 1. 



x - /3 = - a) 1 + 
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The curve Ck-i is a double cover of Ck, since to the equations defining Ck one equation 
for the new variable Xk-i is added: 

x k w = x k -i{x k -i - y). 

Thus, 

V , nP' 
x k ~i = - ± Y — + wx k - 



It follows that the double points are 

" 2' ~ K Aw 



y y 2 



Note that w 7^ at a ramification point. Indeed, if it; = and \l ^ + toj = then ?/ = 0, 



which is impossible in light of Lemma 15.21 Thus we may take w — 1. 

•i 



Hence in affine coordinates, at the double point (| : — a n : /3 : 1) £ Ck-i, we 



have 



2 

+ z k is a local parameter on by the induction hypothesis; 

P\2 _ P 2 



• (^fc-i — f) 2 ~~ T + Zk - 
It follows that: 

• This point is a ramification point indeed; 

• Zk-i — | is a local parameter on Ck-i at this point; 

• Ck-\ is smooth at this point. 

Outside the ramification points, the projection Ck-\ —> Ck is Stale. At infinity all the 
points are smooth, see Lemma 15.21 Therefore, since Ck is smooth and irreducible by the 
induction assumption, then Ck-i is smooth and irreducible as well. 

Let us compute the number of ramification points. We have: 

2 2 2 

_ y y i r ( % / x 

x k -i - -, x k - -—, x k+ i - -y), ■■■ . ( 5 12 ) 

x k+ s = Ps(y), •■• ,x n = p n -k(y), a = p n - k +i{y), 

where p s (y) is a polynomial in y and deg p s (y) = 2 S+1 . 
Hence, the last equation has I < 2 n ~ k+2 distinct roots. 

By the Hurwitz formula (see e.g. [DS|. I, Chapter 2, §2.9]) and the induction estimate for 
gk we obtain: 

g k -i = 2g k -l+ l -< 2{2 n -\n - k - 1) + 1) - 1 + 2 n ~ k+l = 

2 n - {k - 1 \n - k) - 2 ■ 2 n ~ k + 2 n ~ k+1 + 2 - 1 = 2 n - {k - l \n - k) + 1. 
This completes the induction. Thus, g\ < 2 n ^ 1 {n — 2) + 1. □ 
Now, the curve X is obtained from C\ by adding one more equation: 

wy = d 2 — 4u> 2 , 

(this is the last equation of system f !5.4p ). It follows that X is a double cover of C\ with 
double points at w = or y = —Aw. At every such point y ^ 0. Moreover, X is smooth at 
every point of S (see Lemma [5.2p . hence every point of 5* is a ramification point. Thus, X 
is irreducible. Moreover, v is one-to-one at these points. 
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Any other double point is either a ramification point or a double self-intersection. Since 
d 2 = wy + Aw 2 , these are points with y = —Aw. Similarly to (15.121) . there can be at most 2 n 
such points at X. 

From the Hurwitz formula we obtain: 

g(X) < 2g{C 1 ) - 1 + 2 n = 2(1 + 2 n ~ 1 {n - 2)) - 1 + 2 n = 2 n {n - 1) + 1. 

This completes the proof of Proposition 15.51 □ 

Remark 5.7. The more detailed analysis of the curve X shows that it is not smooth only 
if a = —A. If a ^ —A the normalization is not needed. 

Corollary 5.8. For any n > 2, the map e n+ \ : GxG — > G is almost surjective ifG = SL(2, q) 
andq>2 2n+3 (n-l) 2 . 

Proof. We want to prove that any number a G ¥ q is attained by r n . Since the normalization 
X of X is defined over the ground field (see [Sal Chapter 1, §6.4 and §7]), every point 
x G X(W q ) provides a point z>(x) G X(¥ q ). In order to exclude the exceptional points, we 
should take away from X the following points: 

• 2 n points of S; 

• 2 n+1 points with y = 0,d = ±2w; 

• 2 n points with y = —Aw, d = 0. 

Since (for a = —2) the points with y = —Aw, d = may be selfintersections, we should count 
them twice. Thus we need that |X(F g )| > 5 • 2 n . 

We shall use the Weil inequality (see [APj ) once more. For a field ¥ q we need that 

q + l-2gy/q-5>0, (5.13) 

where, by Proposition I5.5j g < 2 n (n — 1) + 1, and 5 = 5 ■ 2 n . Take q > 2 2n+3 (n — l) 2 . 
Then 



(5.14) 



q + 1 - 2g^ -5> 2 2n+ \n - l) 2 + 1 - 2(2 n (n - 1) + l)2 n+l (n - 1)^2 - 5 ■ 2 n 
> 2™(2 n+3 (n - l) 2 - 2 n+2 (n - l) 2 V2 - Ay/2(n - 1) - 5) > 0, 

for any n > 2. □ 

6. Short Engel words in PSL(2, q) 

In this section we prove Corollary Q3] and show that for any n < A the n-th Engel word 
map is surjective for all groups PSL(2,g). From Corollary 13.101 it follows that in order to 
prove that the map e n+ i : G x G — > G is surjective, one should check that for every a G ¥ q 
either a or —a belongs to the image T n of p n . For a fixed n and q big enough it follows from 
Theorem 15. 1[ and so for small values of q it may be verified by computer. Indeed, we have 
done the following calculations for small values of n using the Magma computer program. 



Case e\ = [x,y\: In this case, the surjectivity follows from Proposition 13.4^ Proposition 13.61 
and Remark 14.81 This provides an alternative proof to the well-known fact that any element 
in the group SL(2, q) (and in the group PSL(2, q)), when q > 3, is a commutator (see |Th] ) . 
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Case e 2 = [x,y,y): We need to prove that the map pi is surjective. Indeed, the equation 

px (s, t) - a = s 2 - st 2 + 2t 2 - 2 - a = 

defines a smooth curve of genus 1 with two punctures if a 2 ^ 4. Thus if q > 7 it has a point 
over W q . The case a = 2 was dealt with in Proposition 13.61 The cases q = 5, 7 can easily 
be checked by a computer. Therefore, e 2 is surjective on SL(2, q) \ {—id}, and hence on 
PSL(2, q), for any q > 3. 



Case e 3 = [x, y, y, y): Recall that by Example 14. 1\ e% is no longer surjective on SL(2,g). In 
this case, the curve pz(s, t) — a has genus 2 2 + 1 = 5 and it has at most 20 punctures at oo, 
t 2 = 4 and t — 0. Thus the techniques of Section [5] may be applied for any q which satisfies 
that 

q + 1- I0y/q- 20 > 0, 

namely, for any q > 137. For q < 137 the surjectivity on PSL(2,g) was checked by a 
computer. 

Case e 4 = [x, y, y, y, y]: In this case g = 17, and the computations were done for all q < 1240. 

7. Equidistribution OF the Engel words in PSL(2,g) 

In this section we prove Theorem O by showing first that the n-th Engel word map is 
almost equidistributed for the family of groups SL(2, q), where q is odd, and then explaining 
how this implies that the n-th Engel word map is almost equidistributed (and hence also 
almost measure preserving) for the family of groups PSL(2, q), where q is odd. 

More precisely, for g E G = SL(2, q), let 

E n {g) = {(x,y) EG xG : e n (x,y) = g}, 



then by Definition 11.51 we need to prove the following: 

Proposition 7.1. If q is an odd prime power, then the group G = SL(2, q) contains a subset 
S = C G with the following properties: 

(i) |S| = |G|(l-e); 

(ii) |G|(l-e) < \E n (g)\ < |G|(l + e) uniformly for all g e S ; 
where e — > as q — >■ oo. 

For the commutator word e\ = [x,y], Theorem [O has already been proved in \GS\ Propo- 
sition 5.1]. Hence we may assume that n > 1. Following Section [5] we continue to assume 
that q is odd. We maintain the notation of Definition 13.21 



Proof of Proposition 7J_. Consider the following commutative diagram of morphisms: 

k% u ,^Uk 2 (7.1) 






2 




s,t 


Pn / 










2 




s.t 



Here, 7 = 6 o {p n = e n+ i, p'(s, u, t) = (s 2 + t 2 +u 2 — ust — 2, t), and a is a composition of 
the corresponding morphisms in the diagram. 
We denote f~\a) = f- 1 (a)(¥ q ). 
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Let a G ¥ q , a ^ ±2. Then a -1 (a) is a union of the fibers T z = 7~ 1 (;z), where z G G is 
an element with tr(z) = a. Since a 7^ ±2, any T z may be obtained from any other T z > (with 
tr(z') = a) by conjugation, and so l-E^z)] = \E n (z')\. Hence, 

l7-WI = ™. (-) 

Recall that | SL(2, g)| = q 3 - q. 

Take the set S = S & = {z G G : tr(z) ^ ±2}, then 

|S| = g 3 -2g 2 -g = g 3 (l-0(l/g)), 

satisfying condition (i). 

In order to prove condition (ii) it is enough to show that for any z G G with tr(,z) = a^ ±2, 

17-^)1 =g 3 (l + e), 

where e — > as q — > oo. 
It is well-known that 

\r-\a)\=q 2 (l + e 1 (q)), (7.3) 

where |ei(g)| < ^ (see, for example |Do] ) . 

On the other hand, a = p n op 1 o n. Let us estimate |a _1 (a)|. 

Lemma 7.2. Let p = p' o n. Then there are constants M\ and M 2 such that for every 

(1) Ift 2 ^4 ands^t 2 -2 i/ien |z» _1 (s,t)l = g 4 (l + e 2 ), w/iere |e 2 | < ^; 

(2) //t 2 = 4 &en |p _1 (s,t)| < M 2 q 4 . 

Proof. We use the notation of Proposition 13.41 

(1). Assume that t 2 ^ 4 and s ^ t 2 — 2. According to Case 2 of Proposition 13.41 

\p'- 1 (s,t)\ = \C Stt (¥ q )\ = q±l. (7.4) 

For a point (s',u,t) G C S)t (F ? ) we shall now compute |7r _1 (s', u, t)\. 
We fix a matrix 

' t 1 N 



-1 o 

with t 2 7^ 4. Direct computation shows that (x,y t ) G n~ 1 (s', u, t) if 



X _ [ u + b-at s'-a ' (7 ' 5) 



satisfies that 

5 2 -u 2 a 2 =p(s',u,t) -2, 

where 



u 2 = t 2 - 4, 

bt s' 

(T = a ~2~r ( 7 - 6 ) 

s't u 2 b 

5 = -u-\ 1 . 

2 2 
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Thus, we have a conic once more, and the number of such x is therefore q ± 1. Together 
with (17. 3p and ( I7.4p one has that 

\p-\s, t)\ = (q±l)(q± l)q 2 (l + e 1 (q)) = q\l + e 2 (q)), 

where 

\e 2 \<- + ei(q) + 0(^)<-. 
q x q z/ q 

(2.1). Assume that t = 2. Then (see Case 2 of Proposition I3.4p . 

\CsM\ <2q, (7.7) 

where s — 2 = v 2 , and s' — u = ±v for some v &F q and any (s', u, t) e C S) t(F 9 ). 
We now consider matrices of the form 

1 r 



2/r 

A pair (x, y) G 7T~ 1 (s / , w, 2) if 

x = 

and 



1 



a b 
c s' — a 



a(s' — a) — be = 1, rc + s' = u. 

This implies that 

u — s' ±u> , a(s' — a) — 
c = = , b = 



r r c 

Hence for a fixed y r there are at most 2q possible matrices x defined by the value of a and 
by the sign of c. Together with (17. 3p we get 

|7r-V,u,2)| <2q(q 2 + q). 

It follows from (17. 7p that 

IF 1 (.8,2)\ < 2q(q 2 + q)2q < 5g 4 . (7.8) 
(2.2). Assume that t = —2. Similarly to (2.1) we get 

2)| < 2q(q 2 + q)2q < 5q 4 . (7.9) 

To complete the proof we may take Mi = 4 and M 2 = 10. □ 



We proceed with the proof of the Proposition. By Theorem 15.11 and Proposition 15. 5[ the 
fiber R a = p^ : (a) of p n is isomorphic to a general fiber of X a _ 2 = r ^ 1 (o — 2) and is a curve 
of genus g n < G n , where the bound G n depends only on n. Moreover, it has at most 2 n 
points at infinity and 2 • 2 n points with t 2 = 4. It does not have points of the form (t 2 — 2,t), 
since p(t 2 -2,t) = (2,t), which is is a fixed point. 

Let A = R a n {t 2 ^ 4} and B = R a n {t 2 = 4}. 

According to the Weil estimate, 

\A(¥ q )\=q(l + e 3 (n,q)), (7.10) 

where 

l + 2^-G n + 3-2 n 

\^3{n,q)\ < . 

Q 

Hence, according to Lemma \7. 21 (1). 

\p-\A)(¥ q )\ = q(l + e 3 (n, q))q\l + e 2 ) := q\l + e 4 (n, q)), (7.11) 
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where 

|e 4 (n,g)| < \e 3 (n,q)\ + |e 2 | + |e 3 (n,g)| • \e 2 \ = 0{^—). 

There are at most 2 n+1 points in B, thus by Lemma [7.21 (2). 

\p-\B)(¥ g )\ <2" +1 g 4 M 2 . (7.12) 

Therefore, 

\a-\a)\ = g 5 (l + e 5 (n,g)), (7.13) 



where 



2 n+1 ilf 1 
|e 5 (n,g)| < \e 4 (n,q)\ + 2 - = 0(—). 



Finally, from (17. 2p and (" 17. 1 3[) we obtain 



9 '( 1 + o(-L) 



|t-i(o)| g 2 (l + ei (g)) 
as needed. □ 



We shall now show that Proposition 17.11 implies that the n-th Engel word map is also 
almost equidistributed for the family of groups PSL(2,g), where q is odd. 

Denote by g the image of g G G = SL(2,g) in G = PSL(2,g). Since q is odd, one may 
identify g with the pair {±g}. 

Let S' = {g E G : g G S and -g G S} C S. Then, by Proposition O^, < |G|(l-2e). 
Hence, if S' is the image of the set S' in G = PSL(2, q), then 

< |G|(l-2e). 

For g G G = PSL(2, q), denote 

^n(^) = {(x,y) E G x G : e n (x, y) = g}. 
Observe that for any x,y G G, 

e n (x, y) = e n (-x, y) = e n (x, -y) = e n (-x, -y), 

thus 

4-E n {g) = E n (g)UE n (-g), 

(this is a disjoint union) and so 

\E n (g)\ = \E n (g)\ + \E n (-g)\ 
\G\ ~ 2-\G\ 

Therefore, by Proposition 17.1 K m), for any g G S' one has that 

(1 - e)\G\ < E n (g) < (1 + e)\G\, 

completing the proof of Theorem O 
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8. Concluding remarks 

The trace map is an efficient way to translate an Algebraic word problem on PSL(2, q) to 
the language of Geometry and Dynamics, which has already been used fruitfully in jBGKj. 
We use it in this paper for studying the Engel words, but, actually, the same could be done 
for any other word with the same dynamical properties. Thus, one may ask the following 
questions: 

Question 8.1. What are the words for which the corresponding trace map ip(s,u,t) = 
(/i(s,M,t), f2(s,u,t),t) has the following property (*) for almost all q: 

(*) For every a G ¥ q the set {fi(s,u,t) = a} is an absolutely irreducible affine set. 

Question 8.2. What are the words for which the trace map ip(s, u, t) = (fi(s,u,t), f 2 (s,u,t),t) 
has an invariant plane A and the curves {ip 1= a} are absolutely irreducible for a general 
a G F q and for almost all ql 

We believe that these two Questions are closely related to the following variant of Shalev's 
Conjecture 11.11 

Conjecture 8.3 (Shalev). Assume that w = w(x,y) is not a power word, namely, it is not 
of the form v(x, y) m for some v G F 2 and m G N. Then w(G) = G for G = PSL(2, q). 

One can moreover ask these questions for finite simple non-abelian groups in general. 

Question 8.4. What is an analogue of the trace map for other finite simple non-abelian 
groups? In particular, for the Suzuki groups Sz(g)? (see [BGK, §4]). 

Another interesting question is related to the explicit estimates for q in Proposition 15.51 
The genus of the curve X given there is very large, and this leads to an exponential bound 
for q, as a function of n, for which X(¥ q ) ^ 0. On the other hand, computer experiments 
using Magma demonstrate that this estimate should be at most polynomial. It would be 
very interesting to investigate X and to understand this phenomenon. 
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